Speaker recognition performance under ideal-knowledge noise suppression: an investigation

نویسندگان

  • Nilesh Madhu
  • Sung-Kyo Jung
چکیده

Speaker recognition in mobile devices suffers from poor performance in noisy environments, necessitating the use of noisesuppression algorithms. These typically apply time-frequency masks – optimised on the signal statistics – to the noisy signal spectrum, suppressing the noise components while preserving the speech. Studies in the field of speech recognition demonstrate that ideal time-frequency masks (i.e. masks generated based on ideal knowledge of the speech and noise spectra) improve the recognition rate even at very poor signal-to-noiseratios (SNRs). The effects of such masking on the performance of speaker recognition systems are studied here, to gain a better understanding of pre-processing that is beneficial for automated speaker recognition. Two masking approaches are considered: the ideal binary mask and the ideal Wiener filter. We demonstrate that such ideal noise suppression significantly improves the recognition rate over the unprocessed system. As any noise suppression algorithm involves a trade-off between noise modulation and speech attenuation artefacts, the relative effect of these artefacts on speaker recognition performance is analysed next. We show that speech attenuation has a larger influence on the performance as compared to noise modulation at typical SNR values. Thus, we conclude, preserving speech even at the cost of lower noise suppression (and, consequently, larger noise modulation) is beneficial to speaker recognition. This conclusion is further validated.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Overview of speech enhancement techniques for automatic speaker recognition

Real world conditions differ from ideal or laboratory conditions, causing mismatch between training and testing phases, and consequently, inducing performance degradation in automatic speaker recognition systems [1]. Many strategies have been adopted to cope with acoustical degradation; in some applications of speaker identification systems a clean sample of speech, prior to the recognition sta...

متن کامل

Noise-robust speaker recognition based on morphological component analysis

Speaker recognition suffers severe performance degradation under noisy environments. To solve this problem, we propose a novel method based on morphological component analysis. This method employs a universal background dictionary (UBD) to model common variability of all speakers, a speech dictionary of each speaker to model special variability of this speaker and a noise dictionary to model va...

متن کامل

An Investigation of Spoofing Speech Detection Under Additive Noise and Reverberant Conditions

Spoofing detection for automatic speaker verification (ASV), which is to discriminate between live and artificial speech, has received increasing attentions recently. However, the previous studies have been done on the clean data without significant noise. It is still not clear whether the spoofing detectors trained on clean speech can generalise well under noisy conditions. In this work, we pe...

متن کامل

Spoofing detection under noisy conditions: a preliminary investigation and an initial database

Spoofing detection for automatic speaker verification (ASV), which is to discriminate between live speech and attacks, has received increasing attentions recently. However, all the previous studies have been done on the clean data without significant additive noise. To simulate the real-life scenarios, we perform a preliminary investigation of spoofing detection under additive noisy conditions,...

متن کامل

Speaker Recognition Based on i-vector and Improved Local Preserving Projection

In this paper,a improved local preserve projection algorithm is proposed in order to enhance the recognition performance of the i-vector speaker recognition system under unpredicted noise environment. First , the non zero eigenvalue is rejected when we solve the optimal objective function and only the value greater than zero are used. A mapping matrix is obtained by solving a generalized eigenv...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2014